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Abstract 

For perturbations of integrable Hamiltonians systems, the Nekhoro- 
shev theorem shows that all solutions are stable for an exponentially 
long interval of time, provided the integrable part satisfies a steepness 
condition and the system is analytic. This fundamental result has been 
extended in two distinct directions. The first one is due to Niederman, 
who showed that under the analyticity assumption, the result holds 
true for a prevalent class of integrable systems which is much wider 
than the steep systems. The second one is due to Marco-Sauzin but it 
is limited to quasi-convex integrable systems, for which they showed ex- 
ponential stability if the system is assumed to be only Gevrey regular. 
If the system is finitely differentiable, the author showed polynomial 
stability, still in the quasi-convex case. The goal of this work is to 
generalize all these results in a unified way, by proving exponential or 
polynomial stability for Gevrey or finitely differentiable perturbations 
of prevalent integrable Hamiltonian systems. 



1 Introduction 

1. Consider a near- integrable Hamiltonian system, that is a perturbation 
of an integrable Hamiltonian system, which is of the form 

(H(e,i) = h(i) + f(e,i), 

\\h\ = l, \f\<e«l. 

Here (9, 1) £ T n x R n are angle-action coordinates, and / is a small pertur- 
bation, of size e, in some suitable topology defined by a norm | . | . In the 
absence of perturbation, that is when e is zero, the action variables I{t) are 
integrals of motions and all solutions are quasi-periodic. Therefore it is a 
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natural question, which is in fact motivated by concrete problems of stabil- 
ity in celestial mechanics, to study the evolution of the action variables I(t) 
after perturbation, that is for e > but arbitrarily small. 

2. If the system is analytic and if h satisfies a steepness condition, which 
is a quantitative transversality condition, it is a remarkable result due to 
Nekhoroshev ( [Nek77] . |Nek79| ) that the action variables are stable for an 
exponentially long interval of time with respect to the inverse of the size of 
the perturbation: one has 

\I(t)-I \< Cl e b , \t\ <exp(c 2 £- a ), 

for some positive constants ci,C2,a,b and provided that the size of the per- 
turbation e is smaller than a threshold £q. This is a major result in the 
theory of perturbations of Hamiltonian systems, and it gives complement to 
two previous and equally important results. The first one is that "most" so- 
lutions (in a measure-theoretical sense) are quasi-periodic and hence stable 
for all time, in the sense that 

\I(t) ~ h\ < cVe, t € K, 

for some positive constant c, provided that the system is sufficiently regular 
(analytic, Gevrey or even C k for k > 2n), h satisfies a mild non-degeneracy 
assumption and e is smaller than a threshold eq. This is the content of KAM 
theory, see }Kol54] for the original statement and, for instance [RiisOlj . 
[Sal04] and |Pop04| among the enormous literature, for various improve- 
ments regarding the hypotheses of non-degeneracy and regularity. Hence 
Nekhoroshev estimates give new information for solutions living on the phase 
space not covered by KAM theory, and even though the latter has a small 
measure, it is usually topologically large (at least for n > 3). The second 
important previous result is that there exist "unstable" solutions, satisfying 

\I(t) - J 1 > 1, r = r(e) > 0. 

This was discovered by Arnold in his famous paper [Arn64] . and it has be- 
come widely known as "Arnold diffusion". Even though this phenomenon 
has been intensively studied, very little is known. Here Nekhoroshev esti- 
mates give an exponentially large lower bound on the time of instability r(e), 
explaining (in part) why such instability properties are so hard to detect. 

3. Now returning to Nekhoroshev estimates, the original proof is rather long 
and complicated. It is naturally divided into two parts. The first part, which 
is analytic, is the construction of general resonant normal forms, up to an 
exponentially small remainder (this is where the analyticity of the system is 
used), on local domains of the action space where one has a suitable control 
on the so-called small divisors. Then, the second part, which is geometric, 



2 



consists in the construction of a partition of the action space where one 
can use such normal forms. This is where the steepness of the integrable 
system enters, basically it rules out the existence of solutions which cannot 
be controlled by these normal forms, so eventually the exponentially small 
remainders easily translate into an exponentially long time of stability for 
all solutions. 

4. It has been noticed by the Italian school ([BG G85j . |BG86] ) that us- 
ing preservation of energy, the geometric part of the proof can be simpli- 
fied for the simplest steep Hamiltonians, namely strictly convex or strictly 
quasi-convex Hamiltonians (recall that quasi-convexity means that the en- 
ergy sub- levels are convex subsets). Then, much work have been devoted to 
this special case. In particular, Lochak introduced in |Loc92| a new method 
leading in particular to an extremely simple and elegant proof of these es- 
timates under the quasi-convexity assumption. His approach only relies on 
averaging along periodic frequencies, which enables him to construct special 
resonant normal forms (periodic frequencies are just resonant frequencies 
of codimension-one multiplicities), and the use of the most basic result in 
simultaneous Diophantine approximation, namely Dirichlet's theorem, to 
cover the whole action space by domains where such special normal forms 
can be used. This also brought to light the surprising phenomenon of "sta- 
bilization by resonances", which implies that the more resonant the initial 
condition is, the more stable (in a finite time-scale) the solution will be. This 
method had several applications, and an important one, that we shall be con- 
cerned here, was the extension of these stability estimates for non-analytic 
systems. Indeed, using Lochak's strategy, it was shown in [MS02J that the 
exponential estimates are also satisfied for Gevrey regular systems, and in 
[BoulOb] it was proved that polynomial estimates hold true if the system is 
only of finite differentiability (this is obviously the best one can expect under 
such a weak regularity assumption). Let us point out that for non-analytic 
systems, the construction of these normal forms, which is the only new in- 
gredient one has to add since the geometric part of the proof is insensible 
to the regularity of the system, is more difficult than for an analytic system 
(this is especially true for Gevrey systems). Indeed, one cannot simply work 
with C°-norms (on some complex strip, then by the usual Cauchy estimates 
one has a control on all derivatives on smaller complex strips), so one has to 
work directly with all the derivatives (and moreover keep a control on the 
growth of these derivatives in the Gevrey case). In [MSQ2] and |BoulObj . 
such normal forms were constructed but only because it was enough to con- 
sider periodic frequencies, which boils down to what is classically known as 
a one-phase averaging. 

5. However, all these results were restricted by the quasi-convexity hypoth- 
esis. A study of Nekhoroshev estimates under more general assumptions 
on the integrable part has been initiated by Niederman. First, in [Nie04| . 
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he introduced new geometric arguments based on simultaneous Diophantine 
approximation, leading to a great simplification in the geometric part of 
the proof of Nekhoroshev's result under the steepness condition. Then, in 
|Nie07| . he realized that this method allows in fact to obtain the result for a 
much wider class of unperturbed Hamiltonians, which he called "Diophan- 
tine steep", and which are prevalent (in the sense of Hunt, Sauer and York, 
recall that prevalence is a possible generalization of full Lebesgue measure 
for infinite dimensional linear spaces). However, the analytic part of Nie- 
derman's proof was still based on averaging along general frequencies and 
hence required the construction of general resonant normal forms (which 
was taken from |Pos93j ). In fact, in the non-convex case, a simple averag- 
ing along a periodic frequency, which corresponds to studying the dynamics 
in the neighbourhood of a resonance of co dimension-one multiplicity, can- 
not be enough since solutions will necessarily explore resonances associated 
to different, and possibly all, multiplicities. But then in |BN10j we were 
able to construct normal norms associated to any multiplicities by making 
suitable composition of periodic averagings, with periodic vectors which are 
independent and sufficiently close to each other. This was an extension of 
Lochak's method, in the sense that no small divisors were involved, only (a 
composition of) periodic averagings and simultaneous Diophantine approx- 
imation were used. This proof was not only simpler than the previous one, 
but also opened the way to several applications. For instance, in [BoulOaJ, 
it was shown how one can easily obtain more general results of stability in 
the vicinity of linearly stable quasi-periodic invariant tori. 

6. The aim of this paper is to extend the above results by proving stability 
estimates for Hamiltonian systems with a prevalent integrable part, but 
which are not necessarily analytic. In the Gevrey case, this will lead to 
exponential estimates of stability for perturbation of a generic integrable 
Hamiltonian, as stated below. 

Theorem 1.1. For a > 1, consider an arbitrary a-Gevrey integrable Hamil- 
tonian h defined on an open ball in W 1 . Then for almost any £ 6 W 1 , the 
integrable Hamiltonian h^(x) = h(I) — £.1 is exponentially stable. 

This will be a direct consequence of Theorems 12.21 and Theorem 12.41 
below. This result generalizes the main results of [NieOT] and jBNIOj which 
were restricted by the analyticity assumption (a = 1), and the main stability 
result of [MS02] which was restricted by the quasi-convexity assumption (our 
condition on the integrable part is much more general than quasi-convexity) . 
In the finitely differentiable case, we will obtain polynomial estimates of 
stability for perturbation of a generic integrable Hamiltonian. 

Theorem 1.2. For k > 2n + 2, consider an arbitrary C k integrable Hamil- 
tonian h defined on an open ball in W 1 . Then for almost any £ G W 1 , the 
integrable Hamiltonian h^(x) = h(I) — £.1 is polynomially stable. 
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Once again, this will be a direct consequence of Theorems 12.21 and The- 
orem 12.51 below, and the above result extends the main result of [BoulOb] 
which was only valid for quasi-convex integrable systems. 

2 Main results 

In order to state our results, we now describe our setting more precisely. We 
let Br = B(0,R) be the open ball of W 1 , centered at the origin, of radius 
R > with respect to the supremum norm | . | . Our phase space will be the 
domain V R = T n x B R . 

1. Let us first explain our prevalent condition on the unperturbed Hamil- 
tonian h, which comes from [BN10]. Let G{n,k) be the Grassmannian of 
all vector subspaces of M n of dimension k. We equip M n with the Euclidean 
scalar product, || . || stands for the Euclidean norm, and given an integer 
L G N*, we define G L (n, k) as the subset of G(n, k) consisting of those sub- 
spaces whose orthogonal complement can be spanned by vectors k G Z n \{0} 
with \k\i < L, where | . |i is the £ 1 -norm. 

Definition 2.1. A function h G C 2 {Br) is said to be Diophantine Morse if 
there exist 7 > and r > such that for any L G N*, any k G {1, . . . , n} 
and any A G G L (n,k), there exists (e±, . . . , e^) (resp. (/1, . . . , f n -k) )> an 
orthonormal basis of A (resp. of A- 1 -), such that the function h^ defined on 
Br by 

h\(a, P) = h (aiei H h a k e k + H h Pn-kfn-k) , 

satisfies the following: for any (a, /?) G Br, 

||5 a /i A (a,/3)|| < 7L~ r =^ \\d aa h A (a,P).r]\\ > 7L~ T ||??|| 

for any r/ G R n \ {0}. 

In other words, for any (a,/?) G Br, we have the following alterna- 
tive: either \\d a h\(a, > r )L~ T or \\d aa h\(a, (3).rj\\ > 7L _T ||?7|| for any 
77 G M n \ {0}. This technical definition is basically a quantitative transver- 
sality condition which is stated in adapted coordinates. It is inspired on the 
one hand by the steepness condition introduced by Nekhoroshev ( |Nek77| ) 
where one has to look at the projection of the gradient map V/i onto affine 
subspaces, and on the other hand by the quantitative Morse-Sard theory of 
Yomdin ( [Yom83j , [YC04J ) where critical or "nearly-critical" points of h have 
to be quantitatively non degenerate. It is in fact equivalent to the condition 
introduced by Niederman in [Nie07j : there the author considered the subset 
Gl{ti, k) of G(n, k) consisting of those subspaces which can be spanned by 
vectors k G Z™ \ {0} with \k\\ < L, but one can check that G L (n,k) is in- 
cluded in G L k(n, k) (similarly, G^(n, k) is included in G L> (n, k)). Hence we 
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will stick with the terminology "Diophantine Morse" introduced in [Nie07| . 
and the equivalent term "Simultaneous Diophantine Morse" introduced in 
[BNlOj will not be used any more. 

The set of Diophantine Morse functions on Br with respect to 7 > 
and r > will be denoted by DM^(Br), and we will also use the notations 

DM T {B R ) = (J DM; (B R ), DM (B r ) = [j DM T (Br). 

7>0 t>0 

We recall the following two results from [Nie07l (see also [BN10J). 

Theorem 2.2. Let r > 2(n 2 + 1) and h G C 2n+2 (B R ). Then for Lebesgue 
almost all £ G R n , the function h^(I) = h(I) - £J belongs to DM T (B R ). 

We already mentioned that there is a good notion of "full measure" in 
an infinite dimensional vector space, which is called prevalence (see |OY05| 
and |HK10j for nice surveys), and the previous theorem has the following 
immediate corollary. 

Corollary 2.3. For r > 2(n 2 + 1), DM T (B R ) is prevalent in C 2n+2 (B R ). 

2. Now let us introduce our regularity assumption, starting with the Gevrey 
case. Given a > 1 and L > 0, a real-valued function H G C°°(T)r) is (a, L)- 
Gevrey if, using the standard multi-index notation, we have 

\H\ G ^n)= E L^ a (l\r a \d l H\ c o [VR) <^ 

where | . \c°(p R ) 1S the usual supremum norm for functions on T>r. The space 
of such functions, with the above norm, is a Banach algebra that we denote 
by G a,L (T>R), and in the sequel we shall simply write | . \ ai L = \ ■ \G a - L (v R )- 
Analytic functions are a particular case of Gevrey functions, as one can check 
that G 1,l (Vr) is exactly the space of bounded real-analytic functions on Vr 
which extend as bounded holomorphic functions on the complex domain 

V l (Vr) = {(0,1) G (C n /Z n ) x C n I \1(6)\ < L, d(I,B R ) < L], 

where 1(0) is the imaginary part of 0, | . | the supremum norm on C n and d 
the associated distance on C n . 

3. Therefore we shall consider a Hamiltonian 

H(0,i) = h(i) + f(0,i), (0,1) eV R , 

\h\ a ,L = 1, \f\a,L < £■ 

Our main result in the Gevrey case is the following. 
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Theorem 2.4. Let H be as in fl*J^ and assume that the integrable part h 
belongs to DM^(B), with r > 2 and 7 < 1. Let us define 

a = b = 3- 1 (2(n + l)T)- n . 

Then there exists a constant eq > ; depending on n, R, L 3 a, 7 and t, such 
that if e < Sq, for every initial action 1(0) G -Br/ 2 following estimates 

\I(t) - /(0)| < (n + 1)V, |t| < exp(e- a " la ), 

hold true. 

For a = 1, we exactly recover the main theorem of [BN10J including the 
value of the exponents, therefore the latter result is generalized to the Gevrey 
classes. Moreover, quasi-convex Hamiltonians are a very particular case of 
our class of Morse Diophantine Hamiltonians, hence the stability result of 
[MS02] is also generalized, but not with the same exponents (we did not try 
to improve our exponents). Let us now explain several consequences of our 
result. 

First, note that the only property used on the integrable part h to derive 
these estimates is a specific steepness property, therefore the proof is also 
valid assuming a Diophantine steepness condition as in [N ie07] . which is 
much more general than the original steepness condition of Nekhoroshev. 
Indeed, the class of Diophantine Morse functions (and a fortiori the class of 
Diophantine steep functions) contains fairly degenerate Hamiltonians, as for 
instance linear Hamiltonians with a Diophantine frequency, which of course 
are far from being steep. As a direct consequence, our main theorem also 
gives an alternative proof of exponential stability in the neighbourhood of 
a Gevrey Lagrangian quasi-periodic invariant torus, a fact which was only 
recently proved by Mitev and Popov in |MP10j by the construction a Gevrey 
Birkhoff normal form. 

Then, as it was proved in [BoulOaj . the method we are using is rel- 
atively intrinsic and does not depend much on the choice of coordinates. 
This remark is particularly useful when studying the stability in the neigh- 
bourhood of an elliptic fixed point, and more generally in the neighbourhood 
of a linearly stable lower-dimensional torus, under the common assumptions 
of isotropicity and reducibility (which are automatic for a fixed point or a 
Lagrangian torus). As in [Bo ulOaj , one can easily prove results of exponen- 
tial stability in the Gevrey case under an appropriate Diophantine condition, 
therefore extending the results of exponential stability obtained in [BoulOa) 
which were valid in the analytic case. This also gives an extension of the 
stability result of [MP 10] which is only available for a Gevrey Lagrangian 
torus. 

In [BoulOa] . using the idea introduced by Morbidelli and Giorgilli ([MG95J) 
to combine Birkhoff normal forms and Nekhoroshev estimates, we also had 
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results of super-exponential stability under a Diophantine condition on the 
frequency and a prevalent condition on the formal series of Birkhoff in- 
variants. Using the Gevrey Birkhoff normal form of |MP10| . we can also 
extend this super-exponential stability result to Gevrey classes, but only for 
a Lagrangian torus. For a more general linearly stable torus (isotropic, re- 
ducible), the existence of a Gevrey Birkhoff normal form undoubtedly holds 
true but it is still missing. 

As a last remark, we would like to point out that one can also extend 
a fairly different result of stability, which is due to Berti, Bolle and Biasco 
([BBB03J). This concerns perturbations of a priori unstable Hamiltonians 
systems, which have intensely studied since instability properties in this 
context are much more simple to exhibit. In the analytic case, if the size 
of the perturbation is (a, it was proved in |BBB03j that the optimal time 
of instability is r([i) ~ ju -1 ln/i . The upper bound t(/x) < /i _1 ln/i _1 
follows from a specific construction of an unstable solution, while the lower 
bound t([i) > /u _1 ln^ _1 was a consequence of a stability result, where the 
analyticity of the system was only necessary to apply Nekhoroshev estimates 
both in the quasi-convex and steep case on certain regions of the phase 
space (but because of the presence of "hyperbolicity" , the global stability 
time is far from being exponentially large). In [BP10J, we introduced yet 
another technique (which pertains more to dynamical systems, as opposed 
to the variational arguments of [BBB03J) to construct a solution for which 
t(h) < (i hn.fi , but only the Gevrey case. Now having at our disposal 
Nekhoroshev estimates in the Gevrey case for both quasi-convex and steep 
integrable systems, this implies that the lower bound r(/x) > fi^ 1 In fC 1 can 
also be obtained and that the time of instability r(/i) ~ fi^hn/j, is also 
optimal in the Gevrey case (in fact, using Theorem 12.51 below, this is also 
true if the system is C k for k large enough). This justify the optimality we 
claimed in |BP10| . 

4. Let us now explain our result in the finitely differentiable case. Here 
we assume that H is of class C k , i.e. it is fc-times differentiable and all its 
derivatives up to order k extend continuously to the closure T)r. In order 
to have non-trivial results, we shall assume a minimal amount of regularity, 
that is k > n + 1 and it will convenient to introduce another parameter of 
regularity k* G N* satisfying k > k*n + 1. We denote by C k (T>n) the space 
of functions of class C k on T>r, which is a Banach algebra with the norm 

\ h \c(Pr) = "Y^WY^^CPtpR)* 
\l\<k 

where we have used the standard multi-index notation and where | . \ c°{v R ) 
still denotes the usual supremum norm for functions on T)r. Once again, 
for simplicity, we shall only write \ - \k = \ - \c k (v R )- 
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5. So now we consider a Hamiltonian of the form 

H(9,I) = h(I) + f(6,I), (9,1) eV R , 
|% = l,|/| fc <e. 

Our main result in the finitely differentiable case is the following. 

Theorem 2.5. Let H be as in it**]) , assume that the integrable part h belongs 
to DM^(B), with t > 2 and 7 < 1, and that k > k*n + 1 for some k* G N*. 
Let us define 

a = b = 3- 1 {2{n + l)T)- n . 

Then there exists a constant £0 > 0, depending on n, R, k, 7 and t, such that 
if e < £0; f or every initial action 1(0) £ B R / 2 the following estimates 

\I(t)-I(0)\<(n + l) 2 e b , |t|<£- fc * a , 

hold true. 

The above theorem extends the main result of [B oulObj . which was only 
valid for quasi-convex integrable Hamiltonians. Let us point out that in this 
result (as in the one contained in |Boul0b| ) we have decided to consider 
only the case of integer values of k, but the results can also be extended 
to real values (that is, to Holder spaces) and this would have given a more 
precise exponent of stability in terms of the regularity of the system, but we 
decided not to pursue this further. 

6. Let us now conclude with some notations that we shall use throughout 
the text. 

First, we have define norms for Gevrey and C k functions, but we shall 
need corresponding norms for vector- valued functions (in particular for dif- 
feomorphisms) . Hence given a vector- valued function F : D R — > M m , m £ N* 
and F = (Fi, . . . , F m ), we say that F is (a, L)-Gevrey if F { E G a > L (V R ), for 
1 < i < m, and we will write = YliLi \Fi\a,L- Similarly, F is of class 

C k if Fi G C k (V R ), for 1 < i < m,' and we will write \F\ k = YT=i \ F i\k- 

Then, to avoid cumbersome expressions, we will replace constants de- 
pending only on n, R, L, a, 7 and r (resp. on n, i?, k, 7 and r) in the Gevrey 
case (resp. in the C k case) with a dot. More precisely, an assertion of the 
form "there exists a constant c > 1 depending on the above parameters such 
that u < cv" will be simply replaced with "n <• v" , when the context is clear. 

3 Analytical part 

In this part, we shall describe and prove some normal forms that we will 
need for the proofs of Theorem [23] and Theorem [23J More precisely, Gevrey 
Hamiltonians will be considered in section 13.11 and finitely differentiable 
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Hamiltonians in section 13. 2\ and eventually in section 13.31 we will explain 
the dynamical consequences of these normal forms. 

But first we need to recall the following basic definition, which will be 
crucial to us. 

Definition 3.1. A vector to G M. n \ {0} is said to be periodic if there exists 
a real number t > such that tto G Z n . In this case, the number 



is called the period ofu. 

The easiest example is given by a vector with rational components, the 
period of which is just the least common multiple of the denominators of 
its components. Geometrically, if to is T-periodic, an invariant torus with a 
linear flow with vector to is filled with T-periodic orbits. 

3.1 The Gevrey case 

As in |MS02j . we shall start with perturbations of linear integrable Hamilto- 
nians, for which we will obtain a global normal form (Lemma l3.3l below) . and 
then the latter will be used to obtain local normal forms for perturbations 
of general Hamiltonians (Proposition 13.41 below) . 

1. Let ioi G M. n \ {0} be a T\ -periodic vector, and let h(I) = to\.I be the 
linear integrable Hamiltonian with frequency u)\. In the following, we shall 
consider a "large" positive integer m G N* and a "small" parameter fj,% > 0, 
which will eventually depend on e. We shall also use a real number p\ > 
independent of e, to be fixed below. The following result is due to Marco 
and Sauzin ( |MS02| ). 

Lemma 3.2 (Marco-Sauzin). Consider the Hamiltonian H = l\ + f defined 
on 2?3 Pl , with f G G a ' L (T>3 Pl ) and \ f\ a ,L < Ml* Assume that 



Then there exist L\ = CL, for some constant < C < 1, and an (a, L\)- 
Gevrey symplectic transformation 



T = mf{t > | tu G Z n } 



Tini ■< 1, mTi/ii •< 1. 



(1) 




with |$ x — Id| Qj L 1 <• T\ni such that 

H 1 = H o = l t + g 1 + / 
with {g 1 ,^} = and the estimates 



i 



9 \aM <Ml, |/ \a,Li 



<■ e 



Mi, 



hold true. 
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One can choose the constant C = 16~ 1 (2n) - a~~ . The statement above 
is exactly Proposition 3.2 in [MS02J, where the authors state their result 
for mT\[i\ ■ = 1, but it also holds trivially for smaller to, that is mT\[i\ •< 1. 
The use of this artificial parameter m S N* will make subsequent arguments 
easier. 

It is perhaps useful to understand this lemma in the very special case 
where u\ = e± is the first vector of the canonical basis of R n : the equality 
{g 1 ,^} = simply means that g 1 is independent of the first angle 6\, and 
therefore the evolution of the first action component I\ is only governed by 
the remainder / . 

2. Now we are going to make suitable "compositions" of the above lemma, 
but first we shall explain heuristically what we are planning to do formally 
in the sequel. 

So we consider another periodic vector 002 € R n \ {0}, with period T2 > 
0, which is independent of u±, and we let Z2CO = ^2-1- If the suitable 
hypotheses are met, by Lemma 13.21 we can transform H = l 2 + /, where the 
size of / is of order ^2, into 

H 2 = Ho<f> 2 = l 2 +g 2 + f 2 , {g 2 ,l 2 } = 0, 

with g 2 of order fi2 and f 2 of order e _m /i2 • Now if ll>2 is close enough to uj\ , 
that is \oj2 — k>i| < Ml) an d if A*2 < fJ>i, we can write 

H 2 = (l 2 + l l -l l ) + g 2 + f 2 = l 1 + (l 2 g 2 ) + f = k + f + f\ 

where / = 1% — l\ + g 2 satisfies {/, l 2 } and its size is of order fj,i. For a 
moment, let us forget about / 2 , which is already exponentially small with 
respect to to, and consider l\ +/ as a perturbation of l\. Under the suitable 
assumptions, we can apply once again Lemma [3.21 and find a transformation 
<I> 1 that sends l\ + f into l\ + g 1 + f 1 , where f 1 is exponentially small with 
respect to to and {g 1 , Zi} = 0. 

Now the key point is the following: as / satisfies {f,h} = 0, g 1 and f 1 
also satisfy {g 1 ,^} = {Z 1 ;^} = 0, hence {g ,h} = {g = 0. Indeed, 
it is enough to show that, denoting (^i 1 ) sgK the Hamiltonian flow of 1%, if 
{/,/ 2 } = 0then 

and 

Xx = yJ \f-[f\i)°^sds 

also satisfy {[/]i,^} = and {xij^} = 0. This can be easily proved 
by direct computations, but this is a general fact in normal form theory 
(sometimes known as a "normal form with symmetry") and a nicer way to 
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see this goes as follows. Since = 0, the linear operators L/ 1 = {.,/i} 

and L[ 2 = {., l 2 } commutes, so that the kernel of L/ 2 is invariant by L^, and 
as L\ x is semi-simple, the kernel of Li 2 is also invariant under the projection 
onto the kernel of which is given by the map [.]i. This explains why 
{[/]ij h} = 0. Now f — [f]i is in the kernel of Li 2 , and its unique pre- image 
by Li x is given by x\i hence {xi, h} = 0. Put it differently, if a Hamiltonian 
(above l\ + f) has an integral (in our case, l 2 ), then the integral is invariant 
under the normalizing transformation (in our case, l 2 3>i = Z2) and h 
remains an integral of the normalized Hamiltonian (that is g 1 + f l , ^2})- 
To conclude, taking into account f 2 , the map sends f 2 to / 2 o$ 1 which 
remains exponentially small with respect to m, and so is f2 = f lj rf 2 °^ 1 - 
Therefore 

H 2 o (D 1 = h + g 1 + f 2 , {g 1 ,1,} = {g\l 2 } = 0, 
then setting g 2 = h — h + 9 , we can write again 

H 2 o $! = l 2 + g 2 + f 2 , {g 2 , h} = {g 2 , l 2 } = 0. 
Finally, we have found $2 = ^ 2 ^ l such that 

H 2 = H o $ 2 = H 2 o $ X = H o $ 2 o 3. 1 = Z 2 + 5 2 + h 

with {#2^1} = {52^2} = and f 2 exponentially small. 

3. Now let us make our previous discussion rigorous. 

For i G {1, . . . , n}, let Ui G W 1 \ {0} be independent Tj-periodic vectors, 
and we denote by li the linear integrable Hamiltonian of frequency Wj. We 
consider a positive integer m G N* and a sequence of small parameters 
Hi > 0, for i G {1, . . . , n}. As before, m G N* and /Ui > 0, for i G {1, . . . , n}, 
will eventually depend on e. Now to fix the ideas, we define the increasing 
sequence 

Pi = 2\ i€{l,...,n}. 

We shall need some assumptions on these parameters, so we define the 
condition (Ai) for i G {1, ... ,n} by 

Ti/xi-<l, mTi/ii-<l (Ai) 

and for i G {2, . . . , n}, 

TiHi-<l, mTifii-<l, \ui - <-Mi_i 5 fii-<Hi-i- (Ai) 

Recalling the constant C > that appeared in Lemma 13.21 we shall also 
define the decreasing sequence Lj = C"L, for « G {0, . . . ,n}. In the above 
lemma, we shall use Lemma lA.ll of appendix [AI 
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Lemma 3.3. Let j G {1, . . . , n}, and consider the Hamiltonian H = lj + f 
defined on £>3p», with f G G a,L (T>^ Pj ) and \f\ a .L <■ Mj- Assume that (Ai) is 
satisfied for i £ {1,. . . , j}. T/ien £/iere exists a symplectic transformation 

iu/iere <1> 1 : P 2 p, -> *s (a,Lj_ i+1 )-Gevrey and |$ l - Id| Q , i L j _ i+1 <-7i/Ui 
for i G {1, ... , j}, suc/i f/mf 

Hj = Ho cD, = + 9j + g G 01 '^ (V 2pi ) 

TOi/i {gj, Zj} = 0, for i G {1, . . . , j} ; and t/ie estimates 

II I J? I 77l^~^° 1 ' 

\9j\a,Lj < Ml) |7j"|a,Lj<e /ii, 

Proof. The proof goes by induction. For j = 1, this is nothing but Lemma f3. 2 1 
with gi = g 1 and fi = f 1 - Now assume that the result holds true for some 
j — 1 G {1, . . . , n — 1}, and let us show that it remains true for j G {2, . . . , n}. 
By assumption, (Aj) is satisfied, in particular 

TjfjLj ■< 1, mTjfij •< 1 

hence condition ([1]) of Lemma 13.21 holds true. Therefore, there exists an 
(a, Li)-Gevrey symplectic transformation 

with |<E> J — Idl^ij <-Tjfij such that 

H j = Ho & = lj + + /J 
with {g J , } = and the estimates 

W\a,L x <Hj, \f 3 \aM <e~ ml "[lj, 

hold true. 

Now let us introduce / = lj — lj-\ + gK Obviously, we have {/, lj} = 0. 
Moreover, by assumption (Aj) we have \u)j — u)j—i\ <• l±j-\ and fj,j ■< fJ,j-i so 
that 

|/U,Li < \lj — lj-l\a,Li + l^ 7 |ck,-Li 
<• |wj — + fXj 

Then we can write 

//' . • / • f J - 
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Furthermore, as T>^ Pjl C T>2 Pj , the Hamiltonian Zj_i + / is well-defined on 
V Zpj _^ we have / G G Q ' Ll (2? 3ft _J with |/| a , Ll < /ij -_ 1 . 

Now recall that (Aj) holds true for i G {l,...,j — 1}, hence we can 
eventually apply our hypothesis of induction to the Hamiltonian Zj_i + /: 
there exists a symplectic transformation 

= o • • • o $! : P 2pi -> D 3pj ._ 1 

where <£> l : P 2ft -> ^3 ft is (a, Lj_ i+ i)-Gevrey and - Id| aiLj _ i+1 <T i /i i 
for z G {1, ... ,j — 1}, such that 

(Z,-_i + /) o = + ^ + /_,-_! G G a ' L ^(V 2pi ) 

with /j} = 0, for i G {1, ... ,j — 1}, and the estimates 

hold true. Moreover, as {/, lj} = 0, we also have {fl^-i, lj} = and therefore 
= 0, for iG{l,... , j}. 
Then we set $j = & o so that 



Now 



#7 = Ho^ 



= H 3 O 

= (Z,_! + /) o + /J o 

= Zj_i +/,-_! +f j O<f> j _ 1 . 

We will prove below that f 3 o G G a '^ (P 2pi ), and since we know that 
the function Zj_i + gj_i + G G a ' L ^- 1 (T>2 Pl ), this will easily implies 

that Hj G G a,L] {T>2 Pl ). Now let us define </j = Zj_i — lj + and fj = 
fj-i + f 3 so that we can eventually write 

Hj = lj+gj + fj. 

Since {gj-i, h} = then obviously {gj,h} = 0, for i G {1, . . . ,j}. Therefore 
it remains to prove the estimates. First, we have 

\9j\a,Lj < \lj-l — lj\a,Lj + \9j-l\a,Lj 

< |Zj_i - lj\ a ,Lj + bi-lU.Lj-i 

<• 1^7-1 — Wjl + /Ltl 

<• [Xj-! + //i 

<• Ml- 
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Then, we know that \fj~i\ a ,L j ^ 1 <e ml "pti, so we only need to estimate 
\p o &j-i\ a ,Lj- For that, recall that 

$ 3 _ x = (J)- 5-1 o • • • o (J) 1 

and for i G {1, . . . , j — 1}, we have the estimates 

l^i-Idla^+i <TifM<l. 

Therefore a repeated use of lemma [A . II yields 



\f j o $j-l\ a , Lj = 


\f j 


O O • ■ 


• • o <J> X 


\a,Lj 


< 


\f 


o ^-J -1 o • • 


• o $ 2 


\a,Lj 


< 


\f 


o (I?-? -1 o • 


• • o $ 3 


\a,Lj 


< 


\f 


\a,Li 
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Hence 

\fj\a,L :) < \fj-l\a,Lj + \f ° ^i-lU.Lj 

< l/j-lki,-! +|/ J 'o*j_i| a ,2 /j 

<• e- ml/Q (/ii 

_ 7TJ l/a 

<• e m /xi, 

which is the required estimate. □ 

Here also, it is perhaps useful to understand this lemma in the special 
case where (u>i, . . . , uj n ) is the canonical basis of W 1 : the equality {gj,h} = 
for i G {1, . . . , j} means that gj is independent of the first j angles 9±,. . . ,9j, 
and therefore the evolution of the first j action components I±, . . . ,Ij is only 
governed by the remainder f 3 . In any cases, since we are assuming that 
(uji, . . . ,u> n ) are linearly independent, then for j = n, g n is integrable and 
the action variables can only evolve according to f n . 

4. Now we shall come back to our original setting (j*J), that is 
(H(9,I) = h(I) + f(e,I), (6,1) eV R , 

\\h\a,L = 1, \f\a,L < e. 

For i € {1, ... , n}, we still consider a sequence of Tj-periodic vectors Ui, a 
sequence of small parameters Hi and an integer m EN*. 

Let us fix i G {1, . . . ,n}. If we were able to find a Tj-periodic action 
Ii G Br linked to cjj, that is satisfying V/i(/j) = tOj, then on a small ball 
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of radius around we could perform some standard scalings to reduce 
the study of perturbations of h to the study of perturbations of the linear 
Hamiltonian = 0Ji-I, and so we could use the results of the previous 
section. However, in the sequel we will construct Ui, but since we are not 
assuming that the gradient map of h is invertible, we cannot construct a 
corresponding action. In fact, this is not a serious problem, but this is 
just meant to explain why we need to use some slightly twisted arguments 
below. In [BN10J, we used the idea, coming from [NieOTj, to define domains 
directly in the space of frequencies, however this lead to a rather cumbersome 
definition of domains. Here we shall use a simpler approach that will enable 
us to work in the space of actions. 

For i € {1, . . . ,n}, we consider a sequence of actions Ii which is ^j-linked 
to the sequence of independent periodic vectors Wj, in the sense that 



By the construction of our periodic vectors, such actions will indeed exist. 

Taking into account this sequence of actions (Ii, . . . , I n ) and the size of 
the perturbation e, we define some new assumptions (Bi), for i 6 {1, . . . , n}, 



lTj/Xj-<l, mTi^i-<l, \u>i </ii_i, fii ■< m-x, 

\fj,i ■< 1, e < /if, \Vh(Ii) - Ui\ < 

In the proposition below, we shall denote by : Vr — > Br the pro- 
jection onto the action space, and we shall make use of Lemma IA.2I in 
appendix [A] 

Proposition 3.4. Suppose H is as in d*jj, and assume that {Bi) is satisfied 
for i G {1, . . . , j}. Then there exists a C°° symplectic transformation 



\Vh(Ii) - oji\ < Hi. 



by 



TxHx ■< 1, mTxHi < 1, 

Hx < 1) e < Mi> |V/i(/i) - ux\ < A*i 



and for i £ {2, ... , n}, 



*j : T n x B(I j ;2p 1 fj, j ) -»• T n x £(/,-, 3^^) 



with - Id/bo (jB(/ji2/9lMj )) •< Mi suc/l 



ffotj = h + gj +fj, 



with {gj, li} = /or i € {1, . . . , j} and the estimate 



^6»/ilc* (T«xB(/ 3 ,2p lMj )) < e 




/joWs trite. 
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Note that the proof gives in fact slightly better estimates, and also an 
estimate on the size of the function gj, but this will not be needed in the 
following. The case j = 1 is due to Marco and Sauzin (|MSQ2j), and together 
with an estimate on g~ this case is sufficient to prove effective stability for 
quasi-convex unperturbed systems. But here we shall need this result for 
any j £ {1, . . . , n}: indeed, since we are assuming the periodic vectors to be 
independent, the above proposition will give us information on resonances 
of any multiplicities (the case j S {1, . . . ,n} corresponds to a resonance of 
multiplicity n — j). 

Proof. To analyze our Hamiltonian H in a neighbourhood of size fj,j around 
Ij, we translate and rescale the action variables using the conformally sym- 
plectic map 

aj : (0,1)^(0, 1) = (9,Ij + fijI) 

which sends the domain T>^ p . = T n x B% p . onto T n x B(Ij,3pj/j,j). By the 
condition /ij <• 1 in (Bj), we can assume that the latter domain is included 
in Vr. Let 

H = fij 1 (Hoa J ) 
be the rescaled Hamiltonian, so H is defined on T>$ Pi and reads 

H(9, I) = iiJ l H(9, I, + pji) = nfhtfj + iifl) + ^f(9, Ij + H I) 

for (9,1) G ^3p»- Now using Taylor's formula we can expand h around Ij 
and, assuming with no loss of generality that h(Ij) = 0, we obtain 

h(Ij + /ijl) = fijVh{Ij).I + $ j (1 - t)V 2 h(Ij + tnjl)l.ldt 

Jo 

= fJ>jWj.I + fij(Vh(Ij) — ujj).I 

+ n 2 [ (l-t)V 2 h{Ij+tHjI)I.Idt 
Jo 

= [ijUj.I + fJ-jh(I) 
where we have defined 

h(I) = (Vh(Ij) - ujj).I + ft [ (1 - t)V 2 h{Ij + tpLjI)I.Idt. 

Jo 

Therefore we can write 

H = l j + j 

with 

f = ~h + iiJ l {foo-j). 
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Let us estimate the norm of /. Setting L = L/2, by using Lemma [A, 2 1 (with 
p = 2) we can bound the (a, L)-norm of V 2 h in terms of the (a, L)-norm of 
h. Now recalling that |/i| a ,L = 1, we obtain 

\V 2 h\ arL <l, 

and as |V/i(7j) — < \ij (this is part of assumption (Bj)), we have 

\h\ a> L<H- 
Then, by (Bj) again, e < \i ? hence 

Im/H/ ° a j)\a,L < H] l S < flj. 

and therefore 

\f\a,L ^ \h\a,L + K 7l (/ ° a j)\<x,L 
As (Bi) implies (Ai) for i G {1, . . . we can eventually apply Lemma f3T31 
to the Hamiltonian H = lj + f (replacing L by L and Lj by Lj): there exists 
a symplectic transformation 

$ 3 - = $J o • • • o I) 1 : P 2pi -> D 3p . 

where <l l : £> 2pi -> £>3 W is (a, %_ i+1 )-Gevrey and |$ l - Id| a ^_ i+i <-2>i 
for i G {1, . . . , j}, such that 

Hj = Ho % = lj + ~ 9j + fj G G a ^ (V 2pi ) 
with {gj, l{} = 0, for i G {1, . . . , j}, and the estimates 

wL,£ j <-Mi ; \f3\ a ;L j < e mi, 

hold true. Moreover, if we introduce 

= ^ - fc, 

we still have {Sj, lj} = 0, for i G {1, . . . , j}, and so the transformed Hamil- 
tonian can also be written as 

Hj = Ho $j = lj + h + sj + fj. 

Now scaling back to our original coordinates, we define ^fj = Oj o &j o crj 1 , 
therefore 

*j : T n x B(Ij,2 Pl fij) — ► T n x B(Ij,3p jPj ) 

and 

H otyj = fijH o $j o aj 1 

= + h + Sj + fj) o aj l 

= fj,j(lj + h)o aj l + /XjSj o aj 1 + /Xj/j o aj l . 
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Observe that + h)o(r- 1 = h, so we may set 

g j = Hj §j oaj 1 , fj = Hj fj oaj 1 , 

and write 

HoVj = h + g j + f j . 
It is clear that {gj, li} = 0, for i G {1, ... , j}, and as dgfj = pjdefj, then 

\ c, 0Jj\c°(j n xB(i j ,2p lf i.j)) <" e Mi Mi < e Mi- 

Finally, since 

with |** - Id| a) ^_. +1 <• Ti/ii for % G {1, . . . , j}, then 

|n/$j - Id/lc-op^j <• ^ max {Ti|*i} 

hence 

- U I \ c o^ nxB(I:ji 2p lH )) < Mi . max {Tim} •< 

where the last estimate follows from the fact that Tm <• 1 for i G {1, . . . , j} 
with a suitable implicit constant. This ends the proof. □ 

3.2 The C*-case 

Now let us explain how one can obtain similar normal forms for finitely 
differentiable Hamiltonians, with of course only a polynomial bound on the 
remainder. Here also, as in [BoulObj . we shall first construct a global normal 
form for perturbations of linear integrable Hamiltonians (Lemma 13. 6p and 
then recover local normal forms for perturbations of general Hamiltonians 
(Proposition I3.7j) . 

5. We use the same notations as in the previous section, that is uj\ G M n \{0} 
is a Ti-periodic vector, h(I) = and we have parameters m G N* and 
Hi > 0, while p\ is already fixed. Given k* G N*, recall that we are assuming 
k > k*n + 1. The following result is due to the author ([BoulObj). 

Lemma 3.5. Consider the Hamiltonian H = l\ + / defined on T>% px , with 
f G C k (T>s Pl ) and \f\^ < fi\. Assume that 

Tihi ■< 1, mTxHi'<l. 

Then there exists a C k ~ k symplectic transformation 

S 1 : V 2pi -> V 3pi 
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with |$ x — Id\k-k* <T\Hi such that 

H 1 = H o fc 1 = h + g 1 + f 1 
with {g 1 ,^} = and the estimates 

\g 1 \k-k*<Vii l^lfe-fe* <m~ k *ni, 

hold true. 

The statement above is exactly Proposition 3.2 in [BoulObj . where it is 
stated for mTi/ii • = 1 and k* = k — 2, but of course it also holds for smaller 
m, that is mT\^\ •< 1, and for any integer k* G N* smaller than k. 

6. Now as in the previous section, performing suitable "compositions" of 
the above lemma, this readily gives us the following statement. 

Lemma 3.6. Let j G {1, . . . , n}, and consider the Hamiltonian H = lj + f 
defined on £>3 Pj , with f G C fc (2?3 Pj ) and |/|& <■/«?■ Assume that (vLJ is 
satisfied for i £ {1, . . . , j} . Then there exists a symplectic transformation 

^ = ^0-0^: V 2pi V 3f)j 

where : V 2f>l -> V Zpi is C k ~ kH and | — Id | <Tm fori G {1, . . . 
such that 

Hj =Ho = l 3 + 9j + /, G C k - kH {V 2(n ) 
with {gj, li} = 0, for i G {1, . . . , j}, and the estimates 

\gj\k-k*j < Ml) \fj\k-k*j < m ~ k Ml) 

hold true. 

The proof is completely identical to the proof of Lemma 13.61 (using 
Lemma IA.3I instead of Lemma lA.ip . hence we do not repeat the details. 

7. Finally we come back to the original setting, that is 

H(6,I) = h(I) + f(9,I), (9,1) eV R , 
\h\ k = 1, \f\k < e, 
for which we have the following proposition. 

Proposition 3.7. Suppose H is as in it**]) , and assume that (Bi) is satisfied 
for i G {1, . . . Then there exists a C k ~ k *i symplectic transformation 

tyj : T n x B(I j ,2p 1 fi j ) -> T n x B(Ij,^ Pj Hj) 

with - Id7| c o( B (/ 3 . j2pi ^)) •< fij such that 

HoVj = h + g j +f j , 

with {gj, li} = for i G {1, . . . , j} and the estimate 

|d0/ilc°(T«xB(/j,2piAtj)) < m~ k [ij 

holds true. 
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Once again, the proof is completely analogous to the proof of Proposi- 
tion 13.41 (using Lemma IA.4I instead of Lemma IA.2|) , hence there is no need 
to give further details. 

Let us notice that our last estimate gives in fact 

\fj\k-k*j < m k (J.j, 

but since k — k*j > 1 for any j £ {1, . . . , n}, this yields in particular the 
estimate stated in the above proposition. 

Here the case j = 1 is due to author ([BoulOb]) and is enough to prove 
effective stability for quasi-convex unperturbed systems, but as we already 
explained, we shall need this result for any j E {1, . . . , n}. 

3.3 Dynamical consequences 

Let us now examine the dynamical consequences of our normal forms. As 
usual, it will be used to control the directions, if any, in which the action 
variables in these new coordinates can actually drift. Below we shall state 
a result in the coordinates given by our normal forms, and we shall come 
back to our original coordinates at the beginning of the next section. 

In order to treat the Gevrey case and the C k case in a unified way, we 
introduce yet another parameter r m > which will eventually gives us the 
time of stability. For a-Gevrey Hamiltonians, we set 

T _ P ™ 1/a 

1 m — c ! 

and for C fc -Hamiltonians, with k > k*n + 1 for some k* € N*, this will be 



Under the assumptions of Proposition 13.41 or Proposition 13.71 consider the 
Hamiltonian 

Hj = H otyj = h + gj + fj 

defined on the domain T n x B(Ij, 4//j) (from now on we shall use the fact 
that we have defined pi = 2). Let Mj be the Z-module 

Mj = {k£Z n \ k.Ui = 0, i G {1, . . . , j}}. 

Since the periodic vectors are independent, its rank is n — j, and it is also 
the dimension of the vector space Aj = Aij (8) R spanned by M j . 

We shall need the following lemma, which is completely obvious using 
the definition of the Poisson bracket. 

Lemma 3.8. The equality {gj,h} = 0, for all i G {1, . . . ,j}, is equivalent 
to d e gj{e,I) € Aj, for {9,1) e T n x 5(^,4^). 



21 



Now consider a solution (9 J (t) , P (t)) of the Hamiltonian Hj starting at 
P(tj) G B(Ij,Afij) for some tj G R, and define the time of escape of this 
solution as the smallest time tj G]tj,+oo] for which P (tj) ^ B(Ij,4/ij). 

The only information we shall use from our normal forms is contained in 
the next proposition, where we shall denote by IL, the projection onto the 
linear subspace Aj. 

Proposition 3.9. Let Hj = h + Qj + fj be a Hamiltonian defined on the 
domain T n x B(Ij,4fij), with {gj,h} = for i G {1, . . . ,j}, and such that 
the estimate 

\defj\c (T n xB(I 3 ,4H)) < T mVj 

hold true. Then, with the previous notations, we have 

\P(t) - P(tj) - n,(F(t) - P(tj))\ < t e [tj,r m ] n [tjM 

In particular, 

\r(t)-r(t n )\<fi n , te[t n ,r m ]. 

Proof. Let 11^ be the projection onto the orthogonal complement of Aj, so 
that IX,- + nj- is the identity and therefore 

\P(t) - P(t 3 ) - Uj(P(t) - P{tj))\ = \Uf(P(t) - P(t 3 ))\. 

Now, as long as t < tj, the equations of motion for Hj = h + gj + fj and the 
mean value theorem give 

\P(t) - P(tj)\ <\t- tj\\d e (gj + fj)\ c o(T^B(i 3 A H ))- 

But {gj,k} = for i 6 {1, . . . , j}, so by Lemma I3T81 we have dggj(9, 1) £ Aj 
for any (0,1) G T n x B(Ij, 4/ij), hence if we first project the equations onto 
the orthogonal complement of Aj we have 

\Uf[P{t) - P(tj))\ <\t- tj\\d e f 3 \ c0{TnxmAH)) . 

Now since \t - tj\ < r m and \d e fj\co{Y™xB(i J ,2p lN )) < vVj. the previous 
estimate gives 

I uf (P (t) - P (tj )) I < n 3 , t g [tj, r m [n [tj , tj [, 

and therefore 

\P(t) - P(tj) - Uj(P(t) - P(tj))\ < H , t G [tj, r m [n[tj, t 3 [. 

Finally, for j = n, note that g n is integrable (since A n ) and IT n is identi- 
cally zero, so that the mean value theorem immediately gives t n > r m and 
the estimate 

\I n (t) -I n (t n )\ < Mn, t£[t n ,T m ], 

follows easily. This concludes the proof. □ 
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The interpretation of the above proposition is the following: if Xj is the 
affine subspace passing through I 3 (tj) with direction space Aj, then as long 
as P(t) remains in the domain of definition, it is /ij-close to Xj during an 
interval of time of length r m . This means that for that interval of time, there 
is almost no variation of the action components in the direction transversal 
to Xj, so that any potential drift has to occur along that space. 

If we were assuming that the energy sub-levels of the integrable Hamil- 
tonian are convex, then some direct arguments using the preservation of 
energy would give us a complete stability result for such solutions. But in 
our more general situation, we will have to use indirect and more compli- 
cated geometric arguments. 

4 Geometric part 

In this section, we shall describe some geometric arguments, first introduced 
by Niederman ([Nic04j, see also [BN10J for a somehow clearer exposition), 
that will lead to the proof of both Theorem 12.41 and Theorem 12.51 Without 
loss of generality, we will consider only solutions (0(t),I(t)) starting at time 
to = and evolving in positive time t > 0. In section 14. 1\ we will introduce 
a class of solutions, which we call "restrained", and for which the stability 
of the action variables is easily proved. Then, in section 14.21 we introduce 
the notion of "drifting" solutions, which by definition do not satisfy the 
stability properties implied by restrained solutions. We will then show that 
drifting solutions can in fact be restrained provided some assumptions are 
required, hence leading to the non-existence of such drifting solutions. This 
will eventually give us a proof of Theorem [23] and Theorem l2.5l in section [4^31 
As these geometric arguments are not affected at all by the regularity 
of the system, they will be similar for Gevrey or C k Hamiltonians. Of 
course, they are also the same for analytic Hamiltonians which were studied 
in |BN10]. therefore we shall merely state and explain the relevant results, 
for which detailed proofs are available in [BN10J. 

4.1 Restrained solutions 

1. In order to define our restrained solutions, we shall need some notations. 
Recall from Proposition 13.41 that if the suitable assumptions are met, we can 
define transformations 

: T n x B(Ij,Anj) -> T n x B(I j} Spj^j) 

with iri/^j — Id/|cro(_B(/ 4 w jj •< fJ-j, for j £ {1, . . . , n}. In particular, we can 
easily assume that the the image of ^fj contains the domain T n x B(Ij, 2fj,j). 
From now on, we shall write 

Bj = B{Ij,2nj), je {!,..., n}, 
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and for completeness we set Bo = Br. 

Now consider a solution (6(t),I(t)) G T n x Bq of our Hamiltonian H, 
starting at time to = 0. If at some time t\ > we can find a periodic vector 
wi G R n \ {0} such that 

|V/i(/(ti)) -wi| < Mi, 

then setting Ii = I(t\), (0(ti), 1(h)) G T n x f?i. Therefore, if assumption 
(Bi) is satisfied, we can define a normalized solution for t>t\, 

by 

M/ 1 (0 1 (t),/ 1 (t)) = (^),/W), 

as long as J(£) G B\. Now we can start again, but this time with the solution 
(0 x (i), J 1 ^)) G T n x Si of the Hamiltonian Hi = ii o * a starting at time 
ti: if at some time t 2 > *i we can find a periodic vector ^2 G M n \ {0}, 
independent of u)\, such that 

IV/l^ 1 ^)) -W 2 | < /J 2 , 

then setting I 2 = I 2 (h), (6 1 (t 2 ) , I 1 (i 2 )) G TP x S 2 , and provided (B 2 ) holds, 
we can define yet another normalized solution (9 2 (t) , I 2 (t)) , for i > i 2 , by 

Mo 2 (t),i 2 (t)) = (eHt),iHt)), 

as long as G i3 2 . 

Inductively, setting (0° (t) , 1° (t)) = (6(t),I(t)), for j G {1, . . . ,n} we can 
define the averaged solution (Q 3 (t) , P (t)) , for t>tj, by 

* j (9j(t),P(t)) = (0i-\t) ) P- 1 (t)), 

as long as P (t) G Sj, provided we have found independent periodic vectors 
ui, . . . ,ujj such that 

\X7h(Ij) — Uj\ < fij, 

with Ij = P~ x (tj) and assuming (Bj) is satisfied. 
Moreover, using our estimate on tyj we have 

\P(t)-P-\t)\-< H , je{l,...,n}, 

during that time interval. 

2. We can eventually write our definition. 

Definition 4.1. Given /xq > and m G N*, a solution (8(t),I(t)) of the 
Hamiltonian $*\) or it**]) , starting at time to = 0, is said to be restrained (by 
/io, up to time T m ) if we can find sequences of: 

(1) radii (^i, . . . ,[i n ), with < /% < • • • < /xi < /Xo," 
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(2) independent periodic vectors (oo\, . . . ,uj n ), with periods (T\, . . . ,T n ); 

(3) times (t±, . . . , t n ), with = to < h < ■ ■ ■ < t n < t n+ \ = r m , 

satisfying, for j £ {0, ... ,n— 1}, conditions (Bj + \) and the following con- 
ditions (Cj) defined by 

UP(t)-P(tj)\ < / x i , te[t j ,t j+1 ], 
\\Vh{P(t j+1 ))- Uj+1 \< H+l . J> 

Let us notice that for j G {0, ... ,n — 2}, setting = P(tj + i) the 
second condition of (Cj) gives part of condition (Bj + \), and also it ensures 
that the first condition of (Cj+i) is indeed well-defined. Also, our assump- 
tions imply the inclusion of domains C Bj for j E {1, . . . , n — 1} (which 
was the assumption made in [BN10J): indeed, one can choose an implicit 
constant in (Bj + \) so that 2/ij + i < fij, so if J € Bj+i, then 

< \I-I J+l \ + \P(t ]+l )-p- l (t 3 )\ 

< 2fij + i + fij < 2fij. 

3. The terminology "restrained" was introduced in [BN10J because for such 
solutions, the actions I(t) (or some properly normalized actions P(t)) are 
forced to pass close to a resonance at the time t = tj, the multiplicity of 
which decreases as j increases (since the periodic vectors are assumed to 
be independent), and moreover the variation of these (normalized) actions 
is controlled on each time interval [tj, ij+i]. Hence after the time t n , the 
actions are in a domain free of resonances and are easily confined in view of 
the last part of Proposition [331 This is the content of the above proposition. 

Proposition 4.2. Consider a restrained solution (6(t),I(t)), with an initial 
action 1(0) € B R / 2 - If Ho ■< 1, then the estimates 

\I(t) - I(0)| < (n+l) 2 /io, 0<t<r m , 

hold true. 

This is exactly Proposition 3.7 in [BN10J, to which we refer for the easy 
proof. 

4.2 Drifting solutions 

Restrained solutions are stable for an exponentially long interval of time 
with respect to m, and now we will show that this is in fact true for all 
solutions. 

4. The following definition will be useful in the sequel. 
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Definition 4.3. Given (1q > and m G N* ; a solution (9(t),I(t)) of the 
Hamiltonian or it**]), starting at time to = 0, is said to be drifting (to 
fio, before time T m ) if there exists a time i* satisfying 

- /(0)| = (n + l) 2 /xo, 0<t*<r m . 

Of course, this definition makes sense only if (n + l) 2 //0 < -R/2. In view 
of Proposition 14.21 drifting solutions cannot be restrained. However, we will 
prove below that if such a drifting solution exists, it has to be restrained 
under some assumptions on no, m and e, which will eventually prove that 
drifting solutions do not exist so that all solutions are indeed exponentially 
stable. 

More precisely, assuming the existence of a drifting solution, we will 
construct a sequence of radii (^i, . . . ,n n ), an increasing sequence of times 
(ii, . . . , t n ) and a sequence of linearly independent vectors {uj\, . . . , w n ), with 
periods (Tj, . . . ,T n ) satisfying, for j G {0, . . . ,n — 1}, assumptions (-Bj+i) 
and (Cj). All sequences will be built inductively, and we first describe two 
lemmas that we shall need. 

5. For j G {1, . . . , n}, recall that Aj is the vector space spanned by 

Mj = {k£Z n \ k.ui = 0, i G {1, . . . , j}}, 

and that IX,- (resp. Ilj-) is the projection onto Aj (resp. Aj-). Let us define 
the integer 

L j= sup {|T^|}GN*, j€{l,...,n-l}. 

For completeness, we set Ao = W 1 , Lq = 1 and in this case IIo is nothing 
but the identity. 

The first lemma will allow us to construct the sequence of times, and 
for that we will rely on the fact that our integrable part h belongs to 
DM^(B), so that it satisfies the following steepness property (see Lemma 
3.9 in [BNlOp . 

Lemma 4.4. For j G {0, . . . ,n — 1}, let Xj be any affine subspace with 
direction Aj, and take Cj < 1. Then for any continuous curve Tj : [tj,t|] — > 
Xj n Br with length 

\r j (t*)-T j (t j )\=c j -< 1 LJ^ 
there exists a time tj+\ G such that 

Tj(t) - Tj(tj)\ < cj, t G [tj,t j+1 ], 

u J (VhF j (t j+1 )))\ ->c|. 
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Now to construct the sequence of periodic vectors, we shall use the fol- 
lowing lemma, which is a straightforward application of Dirichlet's theorem 
on simultaneous Diophantine approximation (see Lemma 3.10 in [BN10J). 

Lemma 4.5. Given any vector v E W l and any real number Q > 0, there 
exists a T -"periodic vector u) satisfying 

\v-u\ < r _1 Q - ^, \v\~ x <T <Q\v\~ x . 

6. Now we have the necessary tools to show that drifting solutions cannot 
exist, provided suitable assumptions on the parameters are demanded. This 
will be done inductively, and for technical reasons we separate the first step 
(Proposition 14.6( 1 from the general inductive step (Proposition I4.7| ) . 

Proposition 4.6. Let (0(t),I(t)) be a drifting solution. If fiQ-<~f, then 
there exist a time t%, a T\-periodic vector oj\ and = ■ T^ 1 e ai for some 
positive constant a\, satisfying 

(a) \i(t)-i(o)\ </i , te[o,h); 

(b) |Vfc(J(ti))-wi| <mi- 
Moreover, we have the estimate 

l<T 1 <s- a ^ n - 1 \o 2 , l<Li<e" ai(n_1 W 2 - 

The proof is completely analogous to the proof of Proposition 3.11 in 
[BN10]: it uses the fact that our solution is drifting, Lemma 14.41 applied to 
the curve To(t) = l(t) with Co = /io> and Lemma [431 

Proposition 4.7. Let (9(t),I(t)) be a drifting solution, j G {1, ...,n — 

1} and assume that there exist sequences (t\, . . . ,tj), (oji, . . . , ujj) linearly 
independent and . . . , fij), satisfying assumptions (Bi) and (Cj_i), for 
i G {1, . . . , j}. Assume also that 

(t) (TjUjLj 1 ^ -<n j; 

(a) (T jH Lj i y •< 7 ^7 r ; 

(Hi) iii •<//§■ 

Then there exist a time ij+i, a Tj + i-periodic vector u j + \ and fij+i =■ T^ l e a i + ' 1 
for some positive constant dj+i, satisfying 

(a) \P(t)-P(tj)\<^, te[tj,t j+ x]; 

(b) \Vh(P(t j+1 ))-u j+ i\ < n j+1 ; 
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(c) -u)j\ <Hj. 

Moreover, we have the estimates 

KT j+1 <e- a ^ n -^^ 2 , 1<L J+1 < max {e^-V}^ 2 , 

ie{i,...,j+l} 

and if 

( -i\ 2r 
(iv) (J,j+i-< [TjH j L j j , 

then u)j+i is linearly independent of (u)\, . . . ,u)j). 

Once again, the proof is completely similar to the proof of Proposition 
3.12 in |BN10| . As before, it uses the fact that our solution is drifting, 
Lemma S31 applied to the curve Tj(t) = P (tj) + Uj(P(t) - P(tj)) with 

Cj = (^TjUjLj 1 ^ , Lemma H31 and Proposition 13.91 

4.3 Proof of Theorem 12.41 and Theorem 12.51 

We can finally prove our theorems, and once again we refer to the proof of 
Theorem 2.4 in [BN10J for some more details. 

Proof of Theorem \2.4\ and Theorem \2.5[ As a consequence of Propositions l4T2l 
14.61 and a repeated use of Proposition 14.71 we know that 

\I(t) -/(0)| < (n + l)Vo, 0<t<r m 

provided that the parameters fiQ, m and e satisfy the following eleven con- 
ditions: 

CO Mj+i •< (TjVjLj 1 ) , i e {1, . . . , n - 1}; 

(ii) ^TjfijLJ 1 ^ •< fij, for j G {1, . . . , n - 1}; 
(Hi) mTjfij ■< 1, for j £ {1, . . . , n}; 

(iv) m-<i4', 

(v) e < (j 2 , for j G {l,...,n}; 

(vi) (TjfijLfY ■< 7 Lj T , for j 6 {1, . . . , n - 1}; 

(vii) TjHj •< 1, for j G {1, . . . , n}; 
(viii) fij •< 1, for j G {1, . . . , n}; 
(iac) Hj ■< Hj-u for j G {2, . . . , n}; 
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(x) p -<r, 

(xi) fio •< 1; 

where fj,j =■ T~ x e a i , with a,j to be chosen for j G {1, . . . , n}, and 

K-T^-e-^- 1 )/^ 2 , 1<Lj<- max {e^"" 1 )}^ 2 . 

»e{i,...,j} 

So let us choose m- = s~ a and ro = e b , for two positive constants a and b. 
One can check (this is done in details in [BN10]) that all these conditions 
hold true if we choose 

o i = (2r(n + l))- n - 1 ^' J j€{l,...,n} 

and 

a = 6 = 3 _1 (2r(n + l)) - ", 

provided e < Eq, with a sufficiently small £o depending on n, R, a, L, E, M, 7 
and r (resp. on n, i?, fc, M, 7 and r) in the Gevrey case (resp. in the C k 
case). Now recalling that for a-Gevrey Hamiltonians, 

T - p ™ 1/o 

and for C fc -Hamiltonians, with k > k*n + 1 for k* G N*, 

T m = m k * , 

this completes the proof of both theorems. □ 

A Technical estimates 

In this short appendix, we give some technical estimates concerning Gevrey 
and finitely differentiable functions that we used in section [3J 

1. First in the proof of Lemma 13.31 we used the following estimate con- 
cerning the composition of Gevrey functions. 

Lemma A.l. Let < p' < p and V = CL. Suppose that g G G a ' L (D p ) 
and that $ : T> p / — > T> p is (a, L')-Gevrey. If 

|$ -ld\ atL > <1, 

then / o$ G G a ' L '(V p >) and 

|/°$|a,L' < \ f\a,L- 
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This lemma is contained in the statement of Corollary A.l, Appendix 
A. 2, in [MS02J, to which we refer for a proof and a possible choice of implicit 
constant. 

In fact, we could have used a more elaborated statement concerning 
compositions of Gevrey vector- valued functions (as in Proposition A.l in 
[MS02J ) , which would have implied that the diffeomorphisms &j constructed 
in Lemma 13.31 are in fact (a, Lj)-Gevrey, with an estimate on their distance 
to the identity. However, such a statement is not very elegant to state and 
it is not needed here. 

2. Then, in the proof of Proposition 13.41 we used the following lemma 
which enabled us to bound the Gevrey norm of the derivatives of a function 
in terms of the Gevrey norm of the function. 

Lemma A. 2. Let p > and g G G a,L (V p ). For p G N, we have 

\&9\a,L/2 < \9\a,L- 

zgn 2 ™, \i\=p 

For an easy proof and a possible implicit constant, we refer to Lemma 
A.l, Appendix A.l in [MS02| . 

3. Finally, we shall also need corresponding estimates for the C k norms, 
which are well-known and much more easy to prove. 

The above lemma is the analogue of Lemma IA.ll and it is needed in the 
proof of Lemma 13.61 

Lemma A. 3. Let < p' < p, suppose that g G C k (D p ) and that $ : T> p i — > 
V p is of class C k . If 

|$-Id| fc <-1, 

then / o$ G C k (V p ,) and 

|/°*|fc<|/|fc- 

Finally, here's an easy analogue of Lemma IA.2I which is useful in the 
proof of Proposition 13.71 

Lemma A. 4. Let p > and g G C k {T> p ). For < p < k, we have 

^ \d l g\ k - p <\g\ k . 

ZGN 2 ™, \l\=p 
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